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n  Servers  (WAIS) 


By  Eliot  J.  Christian  and  Timothy  L,  Gauslln,  ISD,  Reston 


Wide  Area  Information  Servers 
(WAIS,  pronounced  "ways") 
address  a  fundament  a]  problem: 
Haw  can  you  find  and  retrieve 
the  data  and  information  you 
need  when  you  don't  know  what 
is  available,  where  to  find  the 
information,  or  haw  to  access  it? 

There  are  many  information  ser- 
vices  and  data  base  products  to  help 
users  find  and  retrieve  information, 
but  most  are  targeted  to  specialists  or 
require  the  user  to  understand  how 
the  data  is  organized.  Because  most 
researchers  gather  information  from  a 
variety  of  sources,  their  frustrations  in 
dealing  with  complex  and  divergent 
approaches  may  be  inhibiting  growth 
in  the  information  services  market. 
Dow  Jones  News  Service  and  Peat 
Marwick  teamed  with  Apple  Com- 
puter and  Thinking  Machines  to  de- 
velop a  facility  whereby  users  with 
limited  computer  skills  could  access 
personal,  corporate,  or  published  in- 
formation from  one  interface — WAIS. 

WAIS  is  developing  in  the  context 
of  an  explosive  growth  of  digital  in- 
formation, especially  as  the  definition 
of  information  expands  beyond 
alphanumeric  to  graphics  and  multi- 
media (e.g,  audio,  music,  video).  Yet, 
network  connectivity  through  utili- 
ties* such  as  the  Internet  is  also  grow- 
ing explosively,  as  is  the  availability  of 
computing  power  ranging  from  ubiq- 
uitous desktop  workstations  to  mas- 
sively parallel  supercomputers. 
Although  conventional  brute  force 


methods  are  capable  of  searching 
sources  containing  several  trillion 
characters  of  text  and  data,  better 
techniques  are  needed  to  make  such 
searches  quick  and  affordable. 

An  Op§n  Standard 

In  the  late  1980's,  the  information 
services  community  was  completing  a 
crucial  piece  of  work—a  standard 
known  as  the  Information  Retrieval 
Service  Definition  and  Protocol  Spec- 
ification (Z39J0),  The  importance  of 
open  standards'' in  a  product  such  as 
WAIS  cannot  be  overemphasised.  Be- 
cause a  single  computer-to-computer 
protocol  is  used,  information  can  be 
stored  anywhere  on  different  types  of 
machines,  and  a  single  interface  can 
retrieve  information  from  all.  As  the 
standard  gains  wide  acceptance,  pro- 
prietary access  mechanisms  will  give 
way  to  competition  On  the  content  of 
information  sources  while  allowing 
&dd-on  features  for  special  interest 
communities. 

The  239,50  standard  is  a  product  of 
the  National  Information  Standards 
Organization  (NISO),  accredited  to 
the  American  National  Standards  In- 
stitute. NISO  239.50  is  fully  compati- 
ble with  the  NISO  standard  for  library 
catalogs  (Z39.2)  originally  promul- 
gated by  the  Library  of  Congress  and 
known  as  MARC  (Machine  Readable 
Cataloging).  Both  239.50  and  Z39.2 
have  corresponding  International 
Standards  Organization  (ISO)  stan» 
dards.  239.50  is  an  applications  ser- 
vice within  the  family  of  ISO 
standards  comprising  the  Open  Sys- 
tems Interconnection  model.  It  is  also 
fully  compliant  with  TCP/IP  as  im- 


plemented on  the  international  net- 
work of  networks  known  as  the 
Internet. 

Adoption  of  the  Z39.50  standard 
protocol  throughout  the  international 
library  community  fits  in  well  with 
the  WAIS  goal  of  making  information 
searches  coherent  across  different 
services.  WAIS  implemented  the  1988 
version  of  the  239.50  protocol  with 
extensions  that  are  now  reflected  in 
the  1992  version  of  239.50.  The  1992 
version,  currently  in  formal  review, 
has  a  built-in  mechanism  to  assure 
that  implements tions  intemperate  ex- 
actly as  the  standard  requires. 
Computer-to-computer  interchanges, 
whether  components  of  the  239.50 
protocol  or  of  the  content  being  deliv- 
ered, are  precisely  represented  in  a 
standard  computer  language  known 
as  Abstract  Syntax  Notation. 

Client  Server  Interaction 

WAIS  implements  239.50  in  a  cli- 
ent/server mode  of  computer  inter- 
action. In  a  typical  search  for  textual 
information,  the  client  software 
prompts  the  user  to  select  which  in» 
formation  sources  to  include  in  the 
search  and  to  enter  a  search  request. 
Once  the  search  request  is  entered  by 
the  user,  the  client  software  converts 
the  search  words  to  the  standard  in- 
formation retrieval  protocol  (Z39.50) 
and  presents  the  search  request  in 
turn  to  each  server  associated  with  a 
selected  source.  The  server  software 
takes  the  words  and  matches  them  to 
the  contents  of  all  documents  in  each 
selected  source.  The  client  software 
receives  search  results  from  all  of  the 
servers  and  presents  to  the  user  a  list 
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of  aU  document  titles  found.  When 
requested  by  the  user,  the  client  soft- 
ware requests  the  server  to  pass  the 
full  contents  of  the  document,  and  the 
client  then  presents  the  document  to 
the  user, 

Documant  Scoring 

Up  to  this  point,  the  Z39.50  cli- 
ent/server interaction  is  fairly  con- 
ventional, and  many  information 
service  providers  will  fit  their  current 
products  into  this  model.  One  very 
useful  feature  of  Z39. 50 is  that  the  user 
*ees  documents  listed  in  a  ranked 
order  based  on  relative  scores  as- 
signed to  documents  by  the  servers. 
The  algorithm  employed  for  the  scor- 
ing of  documents  will  be  a  fascinating 
area  of  development.  In  a  sense,  the 
scoring  algorithm  is  where  judgment 
is  applied  about  the  likelihood  that  a 
particular  document  will  be  seen  as 
relevant  by  the  user.  The  public  do- 
main version  of  W  AIS  has  a  fairly  sim- 
plistic  scoring  algorithm  which 
stresses  the  frequency  of  occurrence 
of  the  searched  for  words.  It  does  in- 
clude differential  weighing  for  where 
in  the  document  the  words  occur  and 
whether  the  words  occur  so  often  as 
to  be  non-specific, 

Rtquests  In  English 

A  critical  issue  in  searching  for  in- 
formation is  that  the  results  of  the 
search  can  be  compromi  sed  if  the  user 
lacks  the  skills  needed  to  use  the  client 
software  to  specify  the  search.  Be- 
cause the  WA1S  goal  is  to  totally  avoid 
the  usual  kinds  of  query  languages,  a 
search  request  in  WAIS  is  simply  en- 
tered by  the  user  in  English,  The 
WA15  software  does  not  analyze 
grammatical  structures  or  the  mean- 
ings of  the  words.  It  simply  depends 
on  the  fact  that  the  scoring  of  docu- 
ments will  tend  to- emphasise  words, 
primarily  nouns,  that  are  more  spe- 
cific. Conversational  embellishments 
will  tend  to  get  low  scores  since  they 
are  unlikely  to  be  of  specific  value  in 


the  target  documents.  While  the  sim- 
plicity of  the  W  AIS  search  request  has 
a  certain  appeal,  real  users  cannot  be 
expected  to  bring  with  them  all  of  the 
key  words  needed  to  find  everything 
that  may  be  relevant.  WAIS  therefore 
provides  an  elegant  mechanism  to  re- 
fine the  search  so  that  the  session  con- 
verges on  meaningful  results — an 
approach  known  as  "relevance 
feedback." 

Rnl»vanc#  Feedback 

A  W AJS  information  search  and  re- 
trieval session  evokes  the  experience 
of  using  a  library,  A  library  user  may 
begin  by  consulting  a  card  catalog  or 
index  or  by  asking  a  reference  librar- 
ian for  help.  At  this  point,  the  user  is 
searching  for  documents  based  on  a 
few  key  words  (e.g.,  subject,  title)  or 
names  (e.g.-;  author).  As  the  user  re- 
views the  documents  found,  he  or  she 
may  note  other  key  words  or  names 
that  could  lead  to  additional  relevant 
documents,  A  feedback  situation  de- 
velops as  the  user  modifies  subse- 
quent searches  based  on  results  found 
in  prior  searches.  Ideally,  the  user 
stops  searching  when  all  the  most  rel- 
evant documents  are  found, 

-  In  WAIS,  any  retrieved  document 
or  part  of  a  document  that  seems  to  be 
getting  d  oser  to  wha  t  the  user  is  inter- 
ested in  may  be  highlighted.  When 
the  user  designates  that  highlighted 
portion  as  relevance  feedback,  WAIS 
treats  the  words  in  that  portion  as  an- 
other search  request.  This  ability  to 
have  the  user  refine  the  search  itera- 
tively  through  relevance  feedback  is 
being  addressed  in  the  1992  version  of 
Z39.50.  Some  WAIS  developers  see 
relevance  feedback  as  crucial  to 
achieving  the  goal  of  eliminating 
complicated  query  languages. 

WAIS  Servers 

WAIS  information  servers  are  reg- 
istered to  the  Directory  of  Servers  cur- 
rently maintained  on  the  Internet  by 
Thinking  Machines.  Registration  re 


quires  a  commitment  to  keep  the 
server  in  a  reliable  operational  mode. 
The  registration  entry  includes  text  In- 
formation about  the  contents  of  the 
sources  reachable  through  the 
server — this  information  is  itself  in- 
dexed for  searching,  (Expected  m  the 
1992  version  of  Z39.50  is  guidance  on 
standardizing  the  mechanism  that  al- 
lows clients  to  learn  about  the  infor- 
mation sources  on  a  server,)  Also 
listed  is  information  that  will  be  used 
by  the  client  software  to  contact  the 
server  (e.g.,  TCP/IP  node  name),  as 
well  as  information  on  what  and  how 
to  pay  charges  for  use  of  the  server  if 
it  is  not  free. 

Any  server  capable  of  responding 
to  239.50  information  retrieval  re- 
quests can  be  an  information  server. 
Information  servers  can  be  local  (on 
the  workstation  or  local  area  network) 
as  well  as  remote  (accessible  now  via 
TCP/IP  and  X.25  networks  or  asyn- 
chronous dial-up  access  in  the  future). 
WAIS  does  not  require  any  central 
coord  ina  Hon  urd  ess  the  server  is  to  be 
advertised  through  the  Directory  of 
Servers.  In  fact,  an  information  server 
registered  to  the  Directory  of  Servers 
can  itself  act  as  a  subordinate  direc- 
tory of  servers  administered  locally, 
By  describing  sources  under  various 
directories  of  servers,  it  is  possible  to 
organize  the  sources  in  whatever  rela- 
tionships make  sense  and  yet  allow 
users  to  search  as  many  sources  as 
desired. 

One.  feature  of  WAIS  that  is  al- 
lowed but  not  required  by  the  Z39.50- 
1988  protocol  is  that  the  client/ server 
interaction  is  "stateless."  At  the  appli- 
cation level,  each  request  from  the  cli- 
ent to  the  server  is  a  separate  process 
■  that  is  not  associated  to  any  previous 
request,  The  server  does  not  maintain 
information  about  the  client  between 
requests.  (For  efficiency,  the  commu- 
nications link  itself  may  be  retained 
across  requests.)  This  feature  is  signif- 
icant for  situations  in  which  a  user 


DEVELOPMENTS— April  1992 


93 


i-)(-i4/unr 


Data  Management 


This  artela  was  rtprlnted  from  mi  April  1992  mw  of  information  systems  Divilopnunts. 


may  want  to  search  hundreds  of 
sources  on  dozens  of  servers  at  £ 
sitting, 

Information  Source* 

Information  servers  exist  to  pro- 
vide access  to  the  information  sources 
placed  on  them.  These  sources  are 
compilations  that  may  include  a  vari- 
ety of  formats.  Such  formats  are 
known  as  "document  types/' 
although  information  need  not  be  tex- 
tual. While  all  Z39.30  clients  and  serv- 
ers support  search  and  retrieval  of 
textual  information,  support  for  other 
document  types  that  may  have  been 
registered  in  239.50  (e.g.,  MARC  bib- 
liographic  format,  graphics, 
hypertext,  video)  is  negotiated  when 
the  client  initiates  its  relationship  with 
a  server. 

When  sources  are  created,  defining 
the  document  types  allows  the  server 
to  use  the  appropriate  translation  be- 
tween the  specific  query  format  of  the 
source  and  the  239.50  protocol.  The 
public  domain  WA1S  package  in- 
cludes assistance  in  creating  informa- 
tion sources.  Indexing  software  is 
provided  in  the  WAIS  package  for 
several  common  document  types  con- 
sisting of  text,  graphics,  and  biblio- 
graphic references  in  MARC  Source 
code  in  the  C  programming  language 
is  provided  for  adding  other  docu- 
ment types.  If  access  to  other  data 
Structures  is  required,  the  server  inter- 
face routines  are  also  designed  to  be 
customized.  A  typical  customization 
would  be  to  use  search  requests  to 
access  a  relational  data  base  such  as 
Ingress  Or  Oracle  using  Structured 
Query  Language  (SQL). 

Existing  and  Propositi  WAIS 
Implsmantatlons 

There  are  hundreds  of  information 
servers  registered  in  the  Directory  of 
Servers,  The  following  list  illustrates 
the  variety  of  uses  to  which  WAIS  is 
being  applied.  At  present,  there  is  no 
charge  for  using  these  servers. 


a  Thinking  Machines  maintains  the 
Directory  of  Servers,  WAIS  docu- 
mentation and  source  code, 
Frequently  Asked  Questions,  some 
patents,  molecular  biology 
abstracts,  a  cookbook,  the  Central 
Intelligence  Agency's  World 
Foctbook,  and  weather  maps  and 
forecasts.  ■ 

•  Massachusetts  Institute  of  Tech- 
nology maintains  a  server  of  classi- 
cal and  modem  poetry, 

®  Cosmic  maintains  descriptions  of 
government  software  packages, 

«  Library  of  Congress  is  creating  a 
WAIS  server  for  their  card  catalog. 

"  Columbia  Law  Library  maintains 
abstracts  of  legal  decisions. 

»  National  Institutes  of  Health  pub- 
lishes announcements  of  opportu- 
nities for  research  grants, . 

«>  The  High  Performance  Computing 
and  Communications  Initiative  is 
looking  at  applying  WAIS, 

•  Apple  has  created  a  dynamic  ver- 
sion of  WAIS  that  periodically  re* 
trieves  requested  information  in  an 
unattended  mode  and  presents  to 

-  the  user  a  "personalized  news- 
paper" showing  the  latest  informa- 
tion as  it  is  published. 

The  range  of  subject  matter  that  can 
be  covered  by  WAIS  could  be  viewed 
as  a  blessing  or  a  curse  depending  on 
whether  the  user  needs  a  very  broad 
or  very  narrow  search.  As  in  any  pub- 
lishing medium,  there  is  also  likely  to 
be  a  wide  range  in  the  credibility  of 
information  across  sources.  WAIS  de- 
velopers are  considering  whether 
sources  might  carry  endorsements 
that  they  would  earn  through  peer 
review,  in  the  manner  of  respected 
science  journals. 

WAIS  Applications  in  the  usgs 

USG5  involvement  with  WAIS 
stems  from  the  efforts  of  the  Informa- 
tion  Systems  Division  (ISD)  to 
enhance  the  current  version  of  the 


Earth  Science  Data  Directory  (ESDD). 
ESDD  is  maintained  as  a  useful  source 
for  references  to  earth  science  data, 
including  many  at  the  state  level,  and 
a  comprehensive  list  of  data  holdings 
relevant  for  arctic  research.  WAIS  is 
especially  appropriate  for  that  appli- 
cation, because  the  ESDD  user  com- 
munity ranges  from  local  citizenry  to 
international  global  change  research- 
ers. The  ability  of  WAIS  to  place  the 
ESDD  in  the  context  of  other  USGS 
and  external  information  sources 
would  be  especially  powerful  ior 
these  users.  One  idea  being  consid- 
ered is  to  publish  in  the  WAIS  Direc- 
tory of  Servers  a  subordinate 
directory  of  servers  focused  on  earth 
science  data  and  information.  That  di- 
rectory would  be  maintained  by  the 
USGS  but  could  list  sources  of  other 
organi2ations  as  well. 

ISD  is  adding  features  required  for 
ESDD  (phrase  searching,  location 
searching,  and  key  word  searching 
within  fields),  which  can  be  accom- 
modated within  WAIS  and  the  Z39.50 
Standard,  ISD  is  also  including  the 
ability  for  a  user  of  the  client  software 
to  drop  from  a  WAIS  session  into  an 
automated  login  to  existing  data  sys- 
tems such  as  the  Global  Land  Infor- 
mation System.  With  this  approach, 
users  of  the  USGS/WAB  client  soft- 
ware would  be  able  to  access  any 
Z39.50  server  but  would  have  addi- 
tional capabilities  when  accessing  one 
of  the  USGS  servers.  A  subgroup  of 
the  USGS  Geographic  Data  Commit- 
tee is  also  exploring  the  possibility  of 
using  WAIS  to  access  a  clearinghouse 
of  USGS  spatial  data  holdings, 

The  following  examples  represent 
instances  where  USGS  data  and  infor- 
mation are  already  being  made  avail- 
able in  a  digital  form.  From  a  technical 
viewpoint,  it  would  be  a  small  step  to 
put  that  information  onto  a  WAIS 
server,  Note  that  until  user  authenti- 
cation is  included  in  Z39.50  (it  is  in  the 
1992  version  but  not  in  the  19S8  ver- 
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sion),  Information  servers  should  not 
include  restricted  information. 

•  Announcements — USGS  News  Re- 
leases, USGS  as  mentioned  in  the 
news,  schedule  of  upcoming 
events,  calls  for  papers,  training 
opportunity  announcements,  com- 
puter virus  alerts,  Requests  for 
Proposals, 

•  Catalogs  and  publications — USGS 
Library  Catalog,  USGS  publlca- 
tionsinduding  Circularsand  Open 
File  reports,  conference  proceed' 
ings,  abstracts  of  publications, 
newsletters,  technology  assess- 
ment reports,  Earth  Science  Infor- 
mation Center  product 
descriptions  and  ordering 
information. 

•  Data  accessed  as  information-— The 
fully  qualified  names  of  data  fields 
can  be  made  to  be  searchable  so 
that  a  user  could  have  the  equiva- 
lent of  a  Model  204  or  SQL  query 
without  having  to  learn  the  query 
language, 

•  Education— The  simplicity  of  the 
WAIS  user  interface  may  make  it  a 
good  tool  for  teaching  students 
how  to  navigate  the  world  of  scien- 
tific information. 

•  Electronic  filing— Personal, 
branch- wide,  or  office- wide  files  of 
outgoing  correspondence  can  be 
maintained  using  WAIS. 

•  Full-text  search  interface  for  CD- 
ROM's— WAIS  may  provide  a 
good  base  for  a  standard  user  inter- 
face for  text  searching  which  is 
being  pursued  by  the  community 
of  CD-ROM  developers  and  users. 

•  Graphics — Scientific  visualization 
product  demonstrations;  architec- 
tural drawings'  such  as  building 
and  floor  layouts  of  USG5  occu- 
pied space;  forms  distribution  for 
local  printing,  with  or  without  fill- 
in  software. 


•  Information  access  from  data— 
Textual  and  graphic  information 
can  be  linked  to  specific  data  fields 
in  a  data  base  (e.g.,  station  history 
linked  from  the  d  ata  val  ues  for  that 
station). 

•  Management  information — Strate- 
gic plans,  all-employee  letters,  se- 
curity guidelines. 

•  Manuals  and  handbooks— Haz- 
ardous materials  information  for 
laboratories,  users'  manuals  such 
as  mainframe  users'  manual  or 
help  files,  technical  manuals,  pro- 
cedures manuals  (e.g.,  quality  as- 
surance procedures),  Survey 
hAnnual,  guidelines  for  ADP  acqui- 
sition with  sample  requirements 
definition  documents. 

■  Personnel—Locator  information 
such  as  phone  directory,  organiza- 
tional directory,  electronic  mail  di- 
rectory; model  position 
descriptions  and  performance 
standards;  vacancy  announce- 
ments; personnel  regulations. 

•  Software— Directories  of  public 
domain  and  shareware,  including 
searching  of  program  descriptions 
and  source  code;  distribution  of 

"software. 

This  list  only  scratches  the  surface 
of  possible  applications,  and  WAIS 
may  well  not  be  the  most  appropriate 
vehicle  for  all  of  these.  There  is  also  a 
set  of  issues  to  be  addressed  concern- 
ing the  informal  release  of  digital  data 
and  information  that  might  be  mis- 
perceived  as  having  been  through  the 
formal  USGS  review  process. 

WAIS  and  Global  Change 

WAIS  is  being  used  for  the  en- 
hanced version  of  the  ESDD  which 
interfaces  with  the  interagency  Global 
Change  Master  Directory,  a  single 
source  for  references  to  key  global 
change  data.  The  global  change  data 
management  community  is  consider- 
ing WAIS  to  rapidly  correlate  the 
Global  Change  Master  Directory  to 


other  already  existingdata  directories 
relevant  to  global  change  research. 
For  example,  the  National  Oceanic 
and  Atmospheric  Administration  has 
a  directory  with  about  *5,000  data  set 
references,  and  the  Inter-University 
Consortium  for  Political  and  Social 
Research  has  another  directory  refer- 
encing about  23,000  data  sets.  This  ap- 
proach would  be  especially  useful  for 
the  very  broad  community  of  users 
intended  to  be  served  by  the  Consor- 
tium for  Internationa]  Earth  Science 
Information  Network  (CIESIN). 
WAIS  hasalso  been  suggested  for  the 
"Directory  of  Directories"  effort  tak- 
ing place  under  the  International 
Council  of  Scientific  Unions'  Commit- 
tee on  Data. 

The  ability  of  WAIS  to  handle  dif- 
ferent information  sources  through  a 
single  user  interface  makes  it  possible 
for  researchers  to  explore  publica- 
tions and  data  sets  concurrently.  The 
federal  research  libraries  involved  in 
global  change  research  (primarily, 
National  Aeronautics  and  Space  Ad- 
ministration, National  Oceanic  and 
Atmospheric  Administration,  U.S. 
Geological  Survey,  and  U.S.  Depart- 
ment of  Agriculture)  are  very  inter- 
ested in  the  potential  for  WAIS  to 
bridge  between  the  data  and  informa- 
tion worlds.  Also,  WAIS  is  seen  as  a 
useful  way  to  connect  textual  infor- 
mation into  a  data  system.  For  exam-, 
pie,  when  a  user  is  researching  an 
existing  data  set,  it  would  be  useful  to 
provide  immediate  access  to  all  of  the 
associated' documentation  about  that 
data  set.  Using  WAIS,  the  associated 
documentation  could  extend  beyond 
the  data  set  itself  to  include  publica- 
tions which  reference  the  data  set  or 
engineering  specifications  of  the  in- 
struments used. 

Because  WA15  simplifies  searching 
for  information  across  many  dimen- 
sions, it  would  be  a  natural  tool  for  use 
in  the  Global  Change  Research  Infor- 
mation Office.  This  new  office  is  man- 
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dated  by  the  Global  Change  Research 
Act  of  1990  to  provide  information 
internationally  on  global  change  re- 
search and  technologies  associated 
with  mitigating  or  adapting  to  global 
change. 

Software  Rtqulremtnts  and 
Support 

The  public  domain  WAIS  software 
is  written  in  C  and  has  been  ported  to 
a  variety  of  platforms.  The  client  soft- 
ware is  available  for  a  range  of  UNIX 
workstations  (including  Sunend  Data 
General)  in  the  X  Window  System, 
MS-DOS  in  either  Windows  3.0  (using 
Dynamic  Link  Libraries)  or  character 
mode,  the  Apple  Macintosh,  and  the 
NeXT  computers,  The  server  software 
runs  on  these  computers,  plus  DEC 
VMS,  IBM  VM,  and  the  Connection 
Machine,  among  others. 

WAIS  clients  and  servers  have 
been  implemented  without  commu- 
nications on  UNIX,  Macintosh,  and 
MS-DOS.  This  allows  WAIS  to  be 
used  also  to  access  data  and  informa- 
tion distributed  on  floppy  disks  or 
CD-ROM's.  Terminals  having  only 
asynchronous  dial-up  capabilities  can 
access  WAIS  by  telephone  connection 
to  a  client  running  on  a  UNIX  ma- 
chine. Any  WAIS  client  can  access 
"WAIS  servers  through  TCP/IP  soft- 
ware. TCP/IP  is  usually  included 
with  UNIX  workstations,  and  both 
MS-DOS  versions  (Windows  and 
non-Windows)  include  public  do- 
main TCP/IP  software. 


Indexing  of  text  to  create  an  infor- 
mation source  is  fairly  rapid:  A  30 
megabyte  file  was  indexed  in  about  20 
minutes  on  a  Data  General  AViiON. 
In  all  instances  explored  so  far,  search- 
ing sources  either  locally  or  on  the 
Internet  occurs  in  a  matter  of  seconds. 

WAIS  Support 

In  pursuing  WAIS  in  the  USGS,  1SD 
will  be  working  closely  with  the 
broader  WAIS  community,  which 
includes  about  150  universities  and  a 
number  of  major  corporations  (Apple, 
Sun,  NeXT,  DEC,  Microsoft,  Dow 
Jones,  Peat  Marwick,  Mead  Data  Cen- 
tral, and  Thinking  Machines,  among 
others).  The  overall  leader  of  the  proj* 
ect  thus  far  has  been  Thinking 
Machines,  lnc,  The  University  of 
North  Carolina  has  proposed  to  the 
National  Science  Foundation  to  be- 
come the  focus  for  long-term  support 
of  WAIS  In  a  manner  similar  to  the 
public  domain  Kermit  software  sup- 
port provided  by  Columbia  Univer- 
sity. Those  heavily  involved  in  WAIS 
development  are  working  very 
closely  with  NISO  as  well. 

Conclusion 

WAIS  represents  more  than  just 
another  slick  new  piece  of  software. 
Although  it  is  based  on  the  Z39.50 
protocol  bom  in  the  library  science 
and  information  services  community, 
WAIS  bridges  to  the  computer  science 
and  data  processing  services  commu- 
nity. Such  bridges  hold  out  the  prom- 


ise of  revolutionary  improvements  in 
information  services  and  in  how  in- 
formation is  handled  in  data  process- 
ing systems. 

The  capabilities  demonstrated  by 
the  239.50  standard  and  the  WAIS  im- 
plementation  challenge  us  in  the 
USGS  to  examine  how  we  think  of 
access  to  data  and  information  and 
how  we  make  public  USGS  data  and 
information. 

Further  Information 

Within  the  USGS,  Tim  Gauslin  of 
1SD  is  currently  the  focal  point  for 
WAIS  development  and  applications 
and  can  provide  some  information  on 
the  NISO  Z39.50  standard.  Because 
Tim  personally  ported  the  WAIS  soft- 
ware  to  MS-DOS  and  the  Data 
General,  he  should  be  contacted  for 
those  versions.  Tim  should  also  be 
consulted  if  there  is  interest  in  bring- 
ing up  USGS  information  servers  and 
sources  in  addition  to  ESDD.  Tim  may 
be  contacted  at; 

U.S.  Geological  Survey 
802  National  Center 
Reston,  Virginia 
703-648-5980 
tgauslin@isdres.er.usgs.gov 

Information  on  WAIS  from  Think- 
ing Machines  is  available  from 
Barbara  Lincoln  (TCP/IP: 
barbara<Sthink.com),  as  well  as 
Brewster  Kahle  of  Thinking  Machines 
(TCP/IP:  brewster®trunk.com).  ■ 
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